MultPIM: Fast Stateful Multiplication for Processing-in-Memory

نویسندگان

چکیده

Processing-in-memory (PIM) seeks to eliminate computation/memory data transfer using devices that support both storage and logic. Stateful logic techniques such as IMPLY, MAGIC FELIX can perform gates within memristive crossbar arrays with massive parallelism. Multiplication via stateful is an active field of research due the wide implications. Recently, RIME has become state-of-the-art algorithm for single-row multiplication by partitions, reducing latency previous 5.1×. In this paper, we begin proposing novel partition-based computation broadcasting shifting data. Then, design in-memory based on carry-save add-shift (CSAS) technique. Finally, develop a full-adder significantly improves (FELIX) design. These contributions constitute MultPIM, multiplier reduces time complexity from quadratic linear-log. For 32-bit numbers, MultPIM additional 4.2× over RIME, while even slightly area overhead. Furthermore, optimize full-precision matrix-vector improve 25.5× FloatPIM multiplication.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient LUT Design on FPGA for Memory-Based Multiplication

An efficient Lookup Table (LUT) design for memory-based multiplier is proposed.  This multiplier can be preferred in DSP computation where one of the inputs, which is filter coefficient to the multiplier, is fixed. In this design, all possible product terms of input multiplicand with the fixed coefficient are stored directly in memory. In contrast to an earlier proposition Odd Multiple Storage ...

متن کامل

Stateful Mobile Modules for Robust In-network Processing

Most sensor-network applications are dominated by the acquisition of sensor values. Due to energy limitations and high energy costs of communication, in-network processing has been proposed as a means to reduce data transfers. As application demands may change over time and nodes run low on energy, get overloaded, or simply face debasing communication capabilities, runtime adaptation is require...

متن کامل

Multiplication in curvature processing.

Multiplication rather than addition of neural signals is believed to underpin a variety of sensory processes, yet the evidence for multiplication is rare. Here we provide psychophysical evidence for neural multiplication in human visual processing of shape. We show that the curvature of a contour is likely detected by a mechanism that multiplies rather than adds the signals from afferent sub-un...

متن کامل

Stateful Scalable Stream Processing at LinkedIn

Distributed stream processing systems need to support stateful processing, recover quickly from failures to resume such processing, and reprocess an entire data stream quickly. We present Apache Samza, a distributed system for stateful and fault-tolerant stream processing. Samza utilizes a partitioned local state along with a low-overhead background changelog mechanism, allowing it to scale to ...

متن کامل

Fast Matrix Multiplication

We give an overview of the history of fast algorithms for matrix multiplication. Along the way, we look at some other fundamental problems in algebraic complexity like polynomial evaluation. This exposition is self-contained. To make it accessible to a broad audience, we only assume a minimal mathematical background: basic linear algebra, familiarity with polynomials in several variables over r...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems Ii-express Briefs

سال: 2022

ISSN: ['1549-7747', '1558-3791']

DOI: https://doi.org/10.1109/tcsii.2021.3118215